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Abstract: Natural resource statistics are often unavailable for small ecological or economic regions and policymakers have to rely on 
state-level datasets to evaluate the status of their resources (i.e., forests, rangelands, grasslands, agriculture, etc.) at the regional or local 
level. These resources can be evaluated using small-area estimation techniques. However, it is unknown which small area technique pro¬ 
duces the most valid and precise results. The reliability and accuracy of two methods, synthetic and regression estimators, used in small- 
area analyses, were examined in this study. The two small-area analysis methods were applied to data from Jalisco’s state-wide natural re¬ 
source inventory to examine how well each technique predicted selected characteristics of forest stand structure. The regression method 
produced the most valid and precise estimates of forest stand characteristics at multiple geographical scales. Therefore, state and local re¬ 
source managers should utilize the regression method unless appropriate auxiliary information is not available. 

Key words: forest structure; regression estimator; synthetic estimator; spatial model; stratified random sampling; satellite imagery; inven¬ 
tory and monitoring 


Introduction 

Health of ecosystems is a matter of national security for most 
industrial economies. Yet conventional approaches to land re¬ 
source inventory and monitoring are being significantly chal¬ 
lenged by new scientific needs and social environmental con¬ 
cerns. One solid example of this is the State of Jalisco’s ecosys¬ 
tem resource inventory and monitoring program, known in Mex¬ 
ico under the acronym of IMRENAT (Flores-Garnica et al. 2007). 
From the beginning, policy makers and stake holders in Jalisco 
called for an inventory and monitoring program that was respon¬ 
sive to the spatiotemporal dependencies of natural resources as 
influenced by natural and human processes. In addition, the de¬ 
sign not only focused on classical statistical population parame¬ 
ters (Schreuder et al. 1993) for commercially valuable resources, 
but also had to account for estimates of other important ecologi¬ 
cal indicators that are fundamental to support decision making 
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processes for ecosystem sustainability at multiple geographical 
and organizational scales. Consequently, IMRENAT was de¬ 
signed to meet these needs and concerns. IMRENAT was im¬ 
plemented in the year 2006; and results presented in early sum¬ 
mer of 2007 (SEDER-FIPRODEFO 2007). Design specifications 
and assessment of its statistical performance have been reported 
(Reich et al. 2008b) for key land resource variables (e.g. canopy 
closure, basal area, cubic volume of commercial trees, and forest 
tree biomass) across bioclimatic regions. 

Federal and state funds in Mexico have been used to create 
many new environmental and land resource management initia¬ 
tives. Stakeholders want to know the current status of the natural 
resources in their region, county, watershed, and land units. 
However, current sources of data do not provide information at 
these organizational scales. At the state level IMRENAT is the 
primary source of information as it provides annual estimates of 
the status of natural resources (i.e., forests, rangelands, grass¬ 
lands, agriculture, etc.). Estimates of these and other land re¬ 
source variables are needed for several levels of geopolitical and 
ecological administrative units, what in Mexico is termed Terri¬ 
tory Land Resource Management (Ordenamiento Territorial). 
The demand for information at the local level has greatly in¬ 
creased, and producing local area statistics has emerged as one of 
the state’s most difficult and pressing statistical problem. SE¬ 
DER-FIPRODEFO is responsible for providing statistics on the 
status of the natural resources for use in ecological sustainability 
research, resource management and planning and assessments of 
land resources to meet regional and local economic needs. 

IMRENAT was designed to provide reliable estimates at the 
state level. Even within the broad climatic regions within the 
state, sample sizes are large enough to produce direct estimates 
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with a reasonable level of precision. However, for many small 
geographical regions within the state, sample sizes are often too 
small to produce reliable estimates. When a small area contains 
sample data, this pennits the use of models with specific small 
area effects, allowing for more accurate estimates of the parame¬ 
ters of interest at the small area level (Fay and Herriot 1979; 
Gosh and Rao 1994; Rao 1999; Pfeffermann 1999; Lehtonen and 
Pahkinen 2004). If no sample data is available, indirect estima¬ 
tors, such as synthetic estimators, are required to produce reliable 
estimates (Rao 2003; Ghosh and Rao 1994). The technique of 
synthetic estimation involves applying national or regional esti¬ 
mates of the characteristic being measured for specific popula¬ 
tion subgroups to the small area's population composition (Laake 
1978). The simplest form of synthetic estimation requires com¬ 
putation of a weighted average of the mean values of the charac¬ 
teristics in the subgroups with weights that are proportional to 
the distribution of the subgroups in the small-area population. A 
more general approach involves regression analysis. In this ap¬ 
proach, national or regional data are used to estimate a regression 
equation which relates the independent variables, which define 
the population subgroups, to the characteristic of interest. The 
values of the regression variables for the small area are then used 
in the equation to obtain estimates of the characteristic for that 
small area. 

This paper briefly describes two techniques which can be used 
to produce small area estimates of forest structure. An overview 
of IMRENAT, which focuses on aspects of the survey design 
that, in turn, affect small area estimation will be presented. Each 
of the techniques is then used with data from IMRENAT to pro¬ 
duce estimates of selected forest stand characteristics for several 
geopolitical regions in the state. Strengths and weaknesses of the 
techniques are discussed in the context of choosing the best ap¬ 
proach for the small area system. 

Methods 


Notation 

Consider a finite population Q = (l,...,7V) consisting of N 

identifiable units. A stratification variable J is used to subdivide 
the population into H strata with N, units in stratum h from 

which a simple random sample of II h units are selected without 

Z H 

I!,, and popula- 
h =1 n 


tion size 


n = Y"n 

1 ‘ 


Associated with the ith unit of the 


Mi stratum are two variables y hi , the variable of interest and AT -, 
a covariate that is available for all units in the population. For the 
Mi stratum let W h — N h /N be the stratum weight, 

fh = n h / N h the sampling fraction, V h , X h , Y h , X h the y- 
and x- sample and population means respectively. As¬ 
sume X = W h X h is known. Let = (l,.. .,D) 
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represent a small area population consisting of 
Nj units 


(u?„ R, = «;;E>, = v) 


which are mem- 


' d -\'— cl —i —a -—> / j ,j-\ - 

bers of the K < H strata each consisting of N dll units 

K<H 

with stratum 


weights W dh = N dh / N d and population mean X t 


Synthetic estimator 

The synthetic estimate for a small area £l d is the sum of the 
weighted average of the stratum means, y, across all strata 
within the small area, where the weight is the proportion of the 
population of the small area that is in each stratum. That is, 

K<H 

y d = Yj w dhy h (d 

h =1 


with estimated variance 


„ / x K<H 



— 

h =1 

V. H h J 


2 

where S h is the sample variance for stratum h. The appropriate¬ 
ness of this approach inherently depends upon the extent to 
which the criterion used to stratify the population is related to the 
parameter of interest. 

Regression estimator 

The regression estimator is based on a regression equation using 
a predictor variable as the independent variable and sample data 
for the variable of interest as the dependent variable. Estimates 
of the dependent variable are computed at the stratum level from 
the field data. Using the population means of the independent 
variable for each stratum a regression equation to predict y, is: 

y h = a + bX h . (3) 

To adjust the relative importance of any one stratum ( h ) to the 
overall mean and to reduce any bias in survey estimates due to 
local differences in the response variable, the stratum means 
were multiplied by the stratum weights 

( z h = W h y h , v h = W h X h ) prior to fitting the regression 

equation (DuMouchel and Duncan 1983). The weighted stratum 
means were used to define the regression estimator (Cochran 
1977): 

z d h=z + b{v dh -v)- (4) 

To obtain an estimate for the small area Cl d , estimates from the 
regression equation are obtained for each stratum and summed 
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V dh -Hv 


( 5 ) 


H 

Letting Hz =^ J W h y h =y s , and 

h =1 
H 

Hv — JV h X h — X the small area estimator 

h =1 

H H 

^j Z dh ^^j^dh^h = yd is obtained b y substitut- 
h =1 h =1 

H H 

ingX^for^V^ = 'YjW dh X dh , giving 

/i=l A=1 


Xy = X, + b ( X d ~ X )- 


( 6 ) 


Thus, the estimate of the population parameter for the small area 
is equal to the estimated population mean ( y st ) from the 

state-wide inventory adjusted for differences between the popu¬ 
lation means of the independent variable at the small area and 
state level. 

The prediction variance for stratum h in the small-area of in¬ 
terest, El d is given by (Cochran 1977) 


V{zJ = MSE 


i+—+ 


H 


I(v*-v ) 2 

h =1 J 


(V) 


where 


MSE 


f H 


{n-2){pi 


h \ 

_\2 


E( z a _z ) 2 _& 2 E( v a _ v ) 


( 8 ) 


h=\ J 


Summing across all K < L strata the prediction variance 
for y d becomes 


V 


fe) : 


MSE 


K + — + ^ 


K<H 

Z(v.-v ) 2 


H 


X(vy,-V) 2 


h =1 


1 — — 
where V = — ^W h X h , V dh = W dh X dh and 

H h=\ 


( 9 ) 


v h is as de¬ 


fined previously. 

The motivation for use of this type of estimator is that if the 
set of independent variables are easily obtainable for the small 
area and if the relationship between the dependent and independ¬ 


ent variables is strong, then estimates of good quality might be 
produced at relatively low cost. 

Example 

We compare the two estimators (i.e., synthetic and regression) 
for forest stand characteristics and their estimated variances for 
12 economic regions in the State of Jalisco; the eight counties in 
the Sierra Ocidente economic region in western Jalisco as well as 
for two small watersheds and a land tenure unit within this eco¬ 
nomic region (Fig. 1). 



Fig. 1 Location of twelve economic regions in the State of Jalisco, 
Mexico; eight counties in the Sierra Ocidente economic region in 
western Jalisco and two small watersheds and a land tenure unit 
within this economic region. 

A two-way nested design was used to stratify the state for the 
purpose of the inventory and monitoring program (IMRENAT). 
The state was first stratified by climatic zones (Reich et al. 2008a) 
which corresponded to broad vegetation types (i.e., semi-arid, 
temperate and tropical) in the state. Each of the three climatic 
zones was further stratified into forested and non-forested areas 
resulting in a total of six strata. Nested within each of the six 
strata, satellite imagery (Landsat-7 ETM+) was used to identify 
ten spectral classes to account for the variability in the vegetation 
cover, resulting in a total of H = 60 strata. A total of n = 1442, 30 
m x 30 m primary sampling units were randomly allocated to 
strata based on the economic importance of the region and 
whether it was forested or non-forested (Table 1). 

Variables selected for analysis included percent canopy clo¬ 
sure, basal area (nr-ha -1 ), cubic volume (m 3 -ha _1 ) and forest tree 
biomass (tones-ha -1 ). Trees (>12.5 cm DBH) were measured for 
diameter at breast height (DBH) and total tree height (m) and 
recorded by species on each sample plot. Tree diameters were 
used to estimate basal area on each sample plot. A spherical den- 
siometer was used to estimate the average percent canopy clo¬ 
sure on each sample plot. Tree diameters and total tree height 
were used to estimate cubic volumes and above ground biomass 
using regression equations provided by the State. Biomass was 
estimated for all trees, while volumes were only estimated for 
commercial species. Volumes for non-commercial species were 
set to zero. 
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Table 1. Distribution of sample plots in the network of permanent 
plots in Jalisco, Mexico by bioclimatic zone, the forest/non-forest 
stratum, and spectral class. 



Semi-Arid 

Temperate 

Tropical 

Spectral 

Class 

Non- 

forested 

Forested 

Non- 

forested 

Forested 

Non- 

forested 

Forested 

i 

9 

16 

0 

27 

i 

22 

2 

5 

50 

12 

44 

15 

34 

3 

8 

28 

32 

40 

7 

25 

4 

4 

35 

24 

58 

8 

31 

5 

9 

24 

29 

78 

16 

28 

6 

6 

26 

20 

40 

11 

41 

7 

3 

19 

38 

33 

10 

28 

8 

11 

30 

11 

66 

14 

34 

9 

8 

20 

24 

64 

14 

53 

10 

2 

16 

8 

42 

7 

24 

Total 

65 

264 

198 

492 

103 

320 





w 



w 


The small area regression estimator requires the availability of 
an independent variable that is correlated to the variable of inter¬ 
est and is available for all sample units in the population to allow 
the computation of population means for individual stratum for 
the population and small area. This infonnation was provided in 
the form of a set of spatial models developed as part of IMRE- 
NAT (Reich et al. 2008b). The models were used to predict the 
spatial distribution of previously selected variables throughout 
the state at the same spatial resolution as the field data and satel¬ 
lite imagery used to stratify the state (Fig. 2). Details of the mod¬ 
eling process can be found in Reich et al. (2008b). The modeling 
process involved using multiple linear regressions to describe the 
large-scale spatial variability, while a tree-based stratified design 
was used to describe the small-scale variability associated with 
site-specific variability in forest stand structure. Independent 
variables used in the models included various Landsat-7 ETM+ 
bands, climatic and topographic data. The fitted models de¬ 
scribed 63% of the observed variability in canopy closure, 62% 
for basal area, 79% for volume and 82% for biomass used in 
their study. 




Fig. 2 Spatial distribution of a) canopy closure (%), b) tree basal area (nr-ha 1 ), c) cubic volumes (m 3 -ha ') of commercially important tree spe¬ 
cies and d) above ground tree biomass (tonnes-ha 1 ) in the State of Jalisco, Mexico. 


Within each of the H= 60 strata, estimates of the mean and 
sample variance were calculated for each variable from the field 
data. Stratum means from the spatial estimates ( X, ) were ob¬ 
tained by averaging these estimates over all N h sample units 
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within a stratum. Stratum sizes were defined as the number of 
sample units associated with each stratum. This resulted in 60 
paired sets of stratum means from the 60 strata for each depend¬ 
ent variable (i.e., field data) and independent variable (i.e., spa- 
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tial models). The stratum means were then weighted by their 
stratum weights. Ordinary least squares was then used to esti¬ 
mate the parameters of the individual regression equations relat¬ 
ing the weighted stratum means from the spatial models to the 
weighted stratum means from the state-wide inventory. For each 
of the small areas identified in Fig. 1, estimates of the mean and 
standard errors were obtained using the synthetic and regression 
estimators. Estimates were compared with respect to their preci¬ 
sion and a two-sample t-test was used to test for significant dif¬ 
ferences in estimated means. Small area estimates, when aggre¬ 
gated over all small areas, were further evaluated on their consis¬ 
tency with state level estimates. 

Results 

Figure 3 shows the standardized stratum means for canopy clo¬ 
sure, basal area, volume and biomass for the 60 strata in the state, 
along with the stratum weights. Stratum means were standard¬ 
ized by subtracting the median value from the individual stratum 
means and dividing by the range in stratum means. The figure 
shows the relationship between the stratum means for the forest 
characteristics and the spectral classes used to stratify the state. 
Since all four variables are correlated they exhibited similar 
trends within strata. Direct estimates of the forest stand charac¬ 
teristics for the three climatic regions are provided in Table 2 
along with their estimated variances and percent sampling errors. 


Table 2. Direct estimates of forest stand characteristics for the three 
bioclimatic regions in the State of Jalisco, Mexico. 


Stand Characteristic 

Sample 

Size 

Min. 

Mean 

Max. 

Variance 

of Mean 

%SE 



Semi-Arid (Area = 1,614,167 ha) 


Canopy Closure (%) 

329 

0 

21.3 

97 

0.89 

8.8 

Basal Area (m 2 -ha‘ 1 ) 


0 

3.6 

107 

0.18 

23.8 

Volume (m 3 -ha’ 1 ) 


0 

19.9 

877 

11.5 

34.0 

Biomass (tonnes-ha* 1 ) 


0 

29.7 

1776 

41.0 

43.2 



Temperate (Area = 4,738,465 ha) 


Canopy Closure (%) 

690 

0 

34.8 

100 

0.85 

5.3 

Basal Area (m 2 ha* 1 ) 


0 

5.0 

59 

0.05 

8.7 

Volume (m 3 -ha _1 ) 


0 

29.8 

658 

3.1 

11.8 

Biomass (tonnes-ha* 1 ) 


0 

36.4 

483 

3.0 

9.6 



Tropical (Area = 1,493,114 ha) 


Canopy Closure (%) 

423 

0 

45.5 

100 

2.1 

6.4 

Basal Area (m 2 -ha* 1 ) 


0 

9.0 

142 

0.33 

12.8 

Volume (m 3 -ha‘ ! ) 


0 

50.5 

1275 

18.2 

16.9 

Biomass (tonnes-ha* 1 ) 


0 

63.3 

749 

17.2 

13.1 


The relationship between the weighted stratum means ob¬ 
tained from the spatial models and the state-wide inventory for 
the selected variables are displayed in Fig. 4 with fit statistics in 
Table 3. Except for the volume equation the estimated slopes did 
not differ significantly from 1 at the 0.05 level of significance. 
Also, the estimated intercepts did not differ significantly from 0 
at the 0.05 level of significance. These results suggest a close 
agreement between the weighted stratum means from the spatial 
models and the field data. Histograms of the residuals and scatter 


plots of the residuals versus the fitted values showed no gross 
deviations from the assumptions of the linear model. 


: Semi-Arid 

A Temperate 

1. Tropical 

' W |Non-Forest 

rX a i 

/ VVi 




Forest \ y* 

Forest jNbn-ToresV 

1 Forest j rJon-Forest 


_0 4 I .. ■ . I ..... I i . . .... I 

<« ' t 5 9 13 17 21 25 29 33 3 7 41 45 49 53 57 61 

Strata 



Fig. 3 Standardized stratum means for canopy closure (%), basal 
area (mMia 1 ), cubic volumes (m Mia 1 ) of commercially important 
tree species, and above ground tree biomass (tonnes-ha 1 ) for the 60 
strata in the state, along with the stratum weights. Stratum means 
were standardized by subtracting the median value from the individual 
stratum means and dividing by the range in stratum means. 



Fig. 4 Scatter plots of the relationships between the weighted stra¬ 
tum means from the state-wide inventory and the weighted stratum 
means from the spatial models for (a) canopy closure (%), (b) tree 
basal area (m 2 -ha ‘), (c) cubic volumes (m 2 -ha"') of commercially 
important species and (d) above ground tree biomass (tonnes/ha) in 
the state of Jalisco, Mexico. The solid lines are the regression lines for 
the individual variables. 

A comparison of the two estimators in estimating basal area 
(nT-ha" 1 ) in the 12 economic regions is summarized in Table 4. 
The table contains estimates of the mean, variances and percent 
sampling errors for each economic region. Also included in the 
tables is the ratio of the variance for the synthetic estimator to the 
variance of the regression estimator. This ratio will be greater 
than one when the regression estimator is more precise than the 
synthetic estimator. A t-statistic for testing the null hypothesis of 
no difference in the estimated means is also provided. 
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Table 3. Estimated regression coefficients relating the weighted stratum means obtained from the spatial models to the weighted stratum means 
obtained from the state-wide inventory for the L=60 strata. 

Model 

n 

A 

se{fi Q ) 

A 

se{(3 x ) 

R 2 

p-value 1 

Canopy Closure (%) 

60 

0.014 

0.016 

0.988 

0.017 

0.982 

0.490 

Basal Area (m 2 /ha) 

60 

0.002 

0.003 

1.018 

0.025 

0.965 

0.469 

Volume (m 3 /ha) 

60 

-0.066 

0.035 

1.144 

0.035 

0.928 

<0.001 

Biomass (tones/ha) 

60 

0.006 

0.039 

1.078 

0.040 

0.926 

0.053 


1 p-value testing the null hypothesis that the slope of the regression equation is equal to one 


Table 4. Comparisons of the synthetic and regression estimators for estimating basal area (nT-ha 1 ) at the regional level in the State of Jalisco, 
Mexico. 




Synthetic Estimator 

Regression Estimator 



Area 

Name 

Mean Variance %SE 

Mean Variance %SE 

Rel.Eff. 1 

t-Statistic 1 


i 

Norte 

4.8 

0.11 

14.1 

4.2 

0.03 

8.3 

3.8 

1.59 

2 

Altos Norte 

1.9 

0.02 

15.7 

1.3 

0.02 

22.5 

1.2 

3.27* 

3 

Altos Sur 

2.3 

0.02 

11.6 

1.5 

0.02 

18.6 

0.8 

3.77* 

4 

Cienega 

2.2 

0.01 

11.4 

1.5 

0.02 

15.2 

0.8 

3.53* 

5 

Sureste 

7.8 

0.08 

7.5 

7.6 

0.03 

4.7 

2.6 

0.38 

6 

Sur 

6.8 

0.10 

9.4 

6.8 

0.03 

5.3 

3.2 

0.16 

7 

Sierra de Amula 

4.6 

0.04 

8.2 

4.8 

0.03 

7.3 

1.2 

-0.56 

8 

Costa Sur 

9.2 

0.20 

9.7 

9.5 

0.03 

3.7 

6.2 

-0.80 

9 

Costa Norte 

9.1 

0.34 

12.8 

8.7 

0.03 

4.0 

11.3 

0.51 

10 

Sierra Ocidente 

8.8 

0.13 

8.2 

10.7 

0.03 

3.4 

3.9 

-4.84* 

11 

Valles 

4.2 

0.04 

8.9 

4.1 

0.03 

8.4 

1.2 

0.49 

12 

Centro 

3.0 

0.02 

10.0 

2.7 

0.03 

12.7 

0.8 

1.27 


Ratio of the variance of the synthetic estimator to the variance of the regression estimator. 

^ Two-sample t-test for differences in the estimated population means. * = significant at the 0.05 level. 


The synthetic estimator had a tendency to overestimate in the 
semi-arid region (regions: 2, 3, 4, 12) and underestimate in the 
tropical region (regions: 8, 9) relative to the regression estimator. 
This trend was consistent with the expectations of managers 
familiar with the forest resources in these economic regions. 
Significant differences between the two sets of estimates oc¬ 
curred primarily in the semi-arid region. More significant differ¬ 
ences were observed between estimates of canopy closure and 
volume than estimates for basal area and biomass. The regression 
estimator was more precise in estimating canopy closure across 
all economic regions. In the semi-arid region the synthetic esti¬ 
mator was more precise in estimating basal area, volume and 
biomass. The synthetic estimator was also more precise than the 
direct estimator in the semi-arid region (Table 2). Both the direct 
and synthetic estimators had similar levels of precision in the 
temperate and tropical regions. 

To further evaluate the regression estimator it was used to es¬ 
timate the forest characteristics for the eight counties in the Si¬ 
erra Ocidente economic region in western Jalisco and two small 
watersheds and a land tenure unit within this economic region. 
The results for cubic volumes (m 3 'ha _1 ) are summarized in Table 
5. Included in the table are the size of each region, estimates of 
the mean, variances, lower and upper 0.95 confidence bounds 
and percent sampling error. The important thing to note is the 
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consistency between the estimates of the mean and variances 
when aggregated from the economic regions up to the state level 
and from the county level up to the economic region. One 
should also note the consistency of estimates from the state-wide 
inventory with the spatial models at the state level. Similar re¬ 
sults were obtained for the other variables. 

Discussion 

This study evaluated the use of the synthetic and regression 
method for estimating forest stand characteristics for small geo¬ 
graphical regions in the State of Jalisco, Mexico. The synthetic 
estimator can be easily applied and tends to be widely accepted 
due to its ease of use. However, this method will be biased if the 
model assumptions leading to the estimator are not satisfied, and 
the magnitude of this bias will likely vary with each application. 
The ability of the synthetic estimator to derive relatively stable 
estimates for small areas depends upon the extent to which the 
stratification scheme accounts for the local variability in the 
parameter of interest. In this study, this was addressed in the way 
the state-wide inventory was designed. The state was stratified 
based on the climatic variability and the spectral properties of 
forested and non-forested land cover types. Thus, the state shares 
characteristics with small areas through the spectral properties 
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used to stratify the state and the fact that estimates from the state- an overestimation. But if the small area consists of high values, 

wide inventory are available for all strata. their estimated means will appear to decrease relative to the 

In spite of the effort to account for the variability in forest population. In the semi-arid region where the precipitation gradi- 

characteristics through the stratification scheme some biases are ent is more pronounced, small areas within this region can be 

evident. The differences are to some extent related to a precipita- very extreme relative to the population; their means are the fur- 

tion gradient that runs east to west through the state (Fig. 2). thest from the population and subject to larger biases. In the 

Within a given climatic region changes in precipitation patterns tropical region, the precipitation gradient is less pronounced and 

can cause differences in the response variable. Assume that the small areas within this region differ from the population mean by 

distribution of the variable of interest is mound shaped and only a little bit and the bias is inconsequential. This is evident 

where the left tail represents the drier portion of the region and when one compares the direct estimates for the three climatic 

the right tail, the wetter portion of the region. If the small area is regions with estimates for the small geographical regions that 

exclusively selected from the left tail of the distribution the esti- make up these regions, 

mated mean will appear to approach the overall mean resulting in 

Table 5. Small-area estimates of cubic volume at the regional, county, watershed and forest level in the State of Jalisco, Mexico. 

Area 

Name 

Area (ha) 

Volume (m 3 /ha) Variance 0.95 Lower Bound 

0.95 Upper Bound % SE 








i 

Norte 

877,349 

22.8 2.93 

19.3 

26.2 

15.0 

2 

Altos Norte 

802,614 

0.4 

1.95 

0.0 

3.2 

647.9 

3 

Altos Sur 

661,719 

2.7 

2.04 

0.0 

5.6 

104.8 

4 

Cienega 

503,973 

5.6 

1.94 

2.8 

8.4 

49.9 

5 

Sureste 

732,339 

43.4 

3.11 

39.9 

46.9 

8.1 

6 

Sur 

598,816 

48.5 

3.28 

44.9 

52.2 

7.5 

7 

Sierra de Amula 

379,810 

23.7 

2.94 

20.3 

27.1 

14.5 

8 

Costa Sur 

696,204 

62.4 

3.13 

58.9 

66.0 

5.7 

9 

Costa Norte 

524,076 

64.2 

3.09 

60.7 

67.7 

5.5 

10 

Sierra Ocidente 

831,069 

69.1 

3.26 

65.5 

72.7 

5.2 

11 

Valles 

627,122 

18.7 

2.91 

15.3 

22.1 

18.2 

12 

Centro 

589,920 

13.4 

2.90 

10.0 

16.8 

25.4 

Average 



31.8 

2.79 

28.4 

35.1 

10.5 

Total 


7,825,010 













i 

Mascota 

185,043 

79.9 3.51 

76.1 

83.6 

4.7 

2 

Mextlan 

63,196 

46.2 

2.67 

42.9 

49.4 

7.1 

3 

Atenguillo 

59,727 

49.5 

2.20 

46.6 

52.5 

6.0 

4 

Ayutla 

88,246 

48.7 

3.03 

45.2 

52.2 

7.1 

5 

Talpa de Allende 

197,824 

100.6 

3.86 

96.7 

104.5 

3.9 

6 

San Sebastian del Oeste 

110,759 

75.8 

3.26 

72.2 

79.4 

4.8 

7 

Cuautla 

42,778 

56.4 

2.30 

53.4 

59.5 

5.4 

8 

Guachingo 

83,495 

21.1 

2.96 

17.7 

24.5 

16.3 

Average 



69.1 

3.22 

66.5 

72.7 

5.2 

Total 


831,069 





















Watersheds 1 and 2 


8,419 


84.8 3.22 

81.2 

88.3 

4.2 

Forest 


654 


95.4 4.70 

91.0 

99.7 

4.5 








State-wide Inventory 


7,825,010 

31.7 

2.28 

28.7 

34.7 

9.5 

Spatial Model 


7,825,010 

31.0 






The regression method demonstrated a significant improve¬ 
ment over the synthetic estimator through the use of the spatial 
models to geographically reflect differences in the variable of 
interest. The accuracy of the estimates relies heavily on the qual¬ 
ity of the spatial models; the stronger the correlation between the 
spatial models and the variable of interest, the less potential bias. 
This correlation is, to some extent related to the spatial resolution 


of the spatial models. Models with a fine spatial resolution are 
expected to be more informative than those based on a coarser 
resolution. If the resolution of the spatial model does not match 
the resolution of the field data this could introduce random noise 
into the estimation process and result in biased estimates because 
the spatial models are not able to account for the local variability 
in the small areas. The regression estimators were also, for the 
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most part, more precise than the synthetic estimator. In estimat¬ 
ing the variances, no attempt was made to take into consideration 
the uncertainty associated with estimates from the spatial models. 
It is not uncommon to assume such estimates are error free, and 
therefore not account for their effects in the model and variance 
calculations. 

Our method of small area estimation may not be compatible 
with certain types of data inquiries. Recently, Reich et al. (2008a) 
developed species distribution maps of several important tree 
species using the topographic and climatic variability for the 
State. Accuracy assessments indicated the models overestimated 
species occurrences compared to the state-wide inventory. The 
disparity between the two estimates could be attributed to the 
methods used to develop the maps as well as the survey itself. 
First, the maps did not necessarily indicate where a tree species 
occurred, but rather where they had a predicted likelihood of 
being found. The climatic data may have been emphasizing the 
fundamental niche (Kearney and Porter 2004) of the tree species 
rather than their realized niche. Second, the state-wide inventory 
was not designed as a vegetation survey and given the diversity 
of the forests and the low sampling intensity associated with 
certain resource characteristics, it is not surprising that the mod¬ 
els overestimated species occurrences. If such models are used 
for small area estimation, one must be aware of the implications 
of the discrepancies between the state-wide inventory and the 
spatial models due to false-positives and false-negatives errors 
and how they may impact the estimation process. 

State and local agencies need valid and relatively precise data 
to account for estimates of important ecological indicators that 
are fundamental to support decision making processes for eco¬ 
system sustainability. Model building takes time and effort and it 
may not be possible to develop reliable models for every situa¬ 
tion. However, there may be a set of spatial models which are 
related to the variable of interest and could potentially be used to 
obtain improved estimates. The decision to use this approach 
would depend on the statistical validity of the estimates as well 
as their acceptance by the policymakers who intend to use them. 
The fundamental issue is whether or not the small geographical 
regions are individually unique and thereby require estimates 
based solely on their own separate data. If this is the case, then 
only direct estimates are appropriate, and a sample survey should 
be designed accordingly. It would therefore be logical for the 
state to either coordinate outside efforts examining other vari¬ 
ables, or alternative statistical methods to provide small area 
estimates for some of the more important indicator variables. 
This would ensure that efforts are not duplicated and are maxi¬ 
mally cost-effective. 

Conclusions 

In summary, this regression-based small-area estimation proce¬ 
dure is relatively computationally simple and provides a signifi¬ 
cant improvement over the often used synthetic method. This 
study provided evidence that IMRENAT can be used as a valid 
source of data. The small area estimation methodology presented 
in this study produces consistent estimates of forest stand struc¬ 


ture at multiple spatial scales. By applying regression models 
derived from the state level to more localized levels, this method 
allows the creation of small area estimates for many different 
geographical regions within the state. The resulting data are in¬ 
formative for decision making and planning efforts to expand 
assistance to land owners in managing their natural resources. 
We anticipate that this will improve planning among local agen¬ 
cies with limited resources. 
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